Graph Tree Decomposition Based Fast Peptide Sequencing and Spectral Alignment

نویسندگان

  • Chunmei Liu
  • Yinglei Song
  • Bo Yan
  • Ying Xu
  • Liming Cai
چکیده

De novo sequencing and spectral alignment are computationally important for the prediction of new peptides via tandem mass spectrometry (MS/MS). Both approaches are established upon the technique of finding the longest antisymmetric path on formulated graphs. The task is often complicated and the prediction accuracy is compromised when given spectra involve noise data, missing mass peaks, or post translational modifications/mutations. This paper introduces a graphical mechanism to describe relationships among mass peaks that, through 1 The preliminary version of this paper appeared in the proceedings of the 11th Pacific Symposium on Biocomputing (PSB2006). * Corresponding Author. Email: [email protected]. International Journal of Computational Science 1992-6669 (Print) 1992-6677 (Online) www.gip.hk/ijcs © 2008 Global Information Publisher (H.K) Co., Ltd. All rights reserved. Graph Tree Decomposition Based Fast Peptide Sequencing and Spectral Alignment GLOBAL INFORMATION PUBLISHER 1 graph tree decomposition, yields linear time and quadratic algorithms for optimal de novo sequencing and spectral alignment respectively. Our test results show that, in addition to high efficiency, the new algorithms can achieve desired prediction accuracy on spectra containing noise peaks and post translational modifications (PTMs) while allowing the presence of both b-ions and y-ions.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fast De novo Peptide Sequencing and Spectral Alignment via Tree Decomposition

De novo sequencing and spectral alignment are computationally important for the prediction of new protein peptides via tandem mass spectrometry (MS/MS). Both approaches are established upon the problem of finding the longest antisymmetric path on formulated graphs. The problem is of high computational complexity and the prediction accuracy is compromised when given spectra involve noisy data, m...

متن کامل

Fast and accurate search for non-coding RNA pseudoknot structures in genomes

MOTIVATION Searching genomes for non-coding RNAs (ncRNAs) by their secondary structure has become an important goal for bioinformatics. For pseudoknot-free structures, ncRNA search can be effective based on the covariance model and CYK-type dynamic programming. However, the computational difficulty in aligning an RNA sequence to a pseudoknot has prohibited fast and accurate search of arbitrary ...

متن کامل

gpALIGNER: A Fast Algorithm for Global Pairwise Alignment of DNA Sequences

Bioinformatics, through the sequencing of the full genomes for many species, is increasingly relying on efficient global alignment tools exhibiting both high sensitivity and specificity. Many computational algorithms have been applied for solving the sequence alignment problem. Dynamic programming, statistical methods, approximation and heuristic algorithms are the most common methods appli...

متن کامل

Efficient Parameterized Algorithm for Biopolymer Structure-Sequence Alignment

Computational alignment of a biopolymer sequence (e.g., an RNA or a protein) to a structure is an effective approach to predict and search for the structure of new sequences. To identify the structure of remote homologs, the structure-sequence alignment has to consider not only sequence similarity but also spatially conserved conformations caused by residue interactions, and consequently is com...

متن کامل

Cross-Lingual Word Representations via Spectral Graph Embeddings

Cross-lingual word embeddings are used for cross-lingual information retrieval or domain adaptations. In this paper, we extend Eigenwords, spectral monolingual word embeddings based on canonical correlation analysis (CCA), to crosslingual settings with sentence-alignment. For incorporating cross-lingual information, CCA is replaced with its generalization based on the spectral graph embeddings....

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008